Overview

Dataset Statistics

Number of Variables 28
Number of Rows 12980
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 12.0 MB
Average Row Size in Memory 973.0 B
Variable Types
  • Numerical: 11
  • Categorical: 17

Dataset Insights

Patient_Id is uniformly distributed Uniform
Pat_Pain_Score is skewed Skewed
ER_Visits is skewed Skewed
Glucose is skewed Skewed
Cost_Of_Initial_Stay is skewed Skewed
Gender has constant length 1 Constant Length
IP_Visits has constant length 1 Constant Length
Readmit30 has constant length 1 Constant Length
Pat_Pain_Score has 6282 (48.4%) zeros Zeros
ER_Visits has 5035 (38.79%) zeros Zeros

Variables


Patient_Id

numerical

Approximate Distinct Count 12980
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 207680
Mean 7904.5389
Minimum 1
Maximum 15821
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Patient_Id is uniformly distributed
  • Patient_Id is skewed right (γ1 = 0.0009)

Quantile Statistics

Minimum 1
5-th Percentile 790.95
Q1 3988.75
Median 7895.5
Q3 11818.75
95-th Percentile 15032.05
Maximum 15821
Range 15820
IQR 7830

Descriptive Statistics

Mean 7904.5389
Standard Deviation 4556.0195
Variance 2.0757e+07
Sum 1.026e+08
Skewness 0.0008815
Kurtosis -1.1907
Coefficient of Variation 0.5764

Gender

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 856680

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row M
2nd row F
3rd row F
4th row M
5th row M

Letter

Count 12980
Lowercase Letter 0
Space Separator 0
Uppercase Letter 12980
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (F, M) take over 50.0%
  • Gender has words of constant length

Marital_Status

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 927727

Length

Mean 6.4736
Standard Deviation 0.4993
Median 6
Minimum 6
Maximum 7

Sample

1st row Married
2nd row Married
3rd row Married
4th row Married
5th row Married

Letter

Count 84027
Lowercase Letter 71047
Space Separator 0
Uppercase Letter 12980
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Single, Married) take over 50.0%

Insurance_Provider

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 951216
  • The largest value (Medicare) is over 5.6 times larger than the second largest value (Commercial)

Length

Mean 8.2832
Standard Deviation 0.6973
Median 8
Minimum 8
Maximum 10

Sample

1st row Medicare
2nd row Medicare
3rd row Medicare
4th row Medicare
5th row Medicare

Letter

Count 107516
Lowercase Letter 94536
Space Separator 0
Uppercase Letter 12980
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Medicare, Commercial) take over 50.0%
  • The largest value (medicare) is over 5.6 times larger than the second largest value (commercial)

Tobacco_User

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 898422
  • The largest value (Quit) is over 1.68 times larger than the second largest value (Never)

Length

Mean 4.2159
Standard Deviation 0.7657
Median 4
Minimum 3
Maximum 9

Sample

1st row Never
2nd row Quit
3rd row Yes
4th row Quit
5th row Never

Letter

Count 54639
Lowercase Letter 41576
Space Separator 83
Uppercase Letter 13063
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Quit, Never) take over 50.0%
  • The largest value (quit) is over 1.68 times larger than the second largest value (never)

Depression

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 870416
  • The largest value (No) is over 16.17 times larger than the second largest value (Yes)

Length

Mean 2.0582
Standard Deviation 0.2342
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row Yes
3rd row No
4th row Yes
5th row No

Letter

Count 26716
Lowercase Letter 13736
Space Separator 0
Uppercase Letter 12980
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 16.17 times larger than the second largest value (yes)

ICU

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 870274
  • The largest value (No) is over 20.14 times larger than the second largest value (Yes)

Length

Mean 2.0473
Standard Deviation 0.2123
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row No
3rd row Yes
4th row No
5th row No

Letter

Count 26574
Lowercase Letter 13594
Space Separator 0
Uppercase Letter 12980
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 20.14 times larger than the second largest value (yes)

Drug_Abuse

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 870146
  • The largest value (No) is over 25.71 times larger than the second largest value (Yes)

Length

Mean 2.0374
Standard Deviation 0.1899
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row No
3rd row Yes
4th row No
5th row No

Letter

Count 26446
Lowercase Letter 13466
Space Separator 0
Uppercase Letter 12980
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 25.71 times larger than the second largest value (yes)

Mood_Disorder

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 869816
  • The largest value (No) is over 82.21 times larger than the second largest value (Yes)

Length

Mean 2.012
Standard Deviation 0.109
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row Yes
3rd row No
4th row No
5th row No

Letter

Count 26116
Lowercase Letter 13136
Space Separator 0
Uppercase Letter 12980
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 82.21 times larger than the second largest value (yes)

Diabetes

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 869848
  • The largest value (No) is over 68.04 times larger than the second largest value (Yes)

Length

Mean 2.0145
Standard Deviation 0.1195
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row No
3rd row No
4th row No
5th row No

Letter

Count 26148
Lowercase Letter 13168
Space Separator 0
Uppercase Letter 12980
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 68.04 times larger than the second largest value (yes)

Anxiety

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 872430
  • The largest value (No) is over 3.69 times larger than the second largest value (Yes)

Length

Mean 2.2134
Standard Deviation 0.4097
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row Yes
3rd row No
4th row Yes
5th row Yes

Letter

Count 28730
Lowercase Letter 15750
Space Separator 0
Uppercase Letter 12980
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 3.69 times larger than the second largest value (yes)

Obesity

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 870429
  • The largest value (No) is over 15.88 times larger than the second largest value (Yes)

Length

Mean 2.0592
Standard Deviation 0.2361
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row No
3rd row No
4th row No
5th row No

Letter

Count 26729
Lowercase Letter 13749
Space Separator 0
Uppercase Letter 12980
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 15.88 times larger than the second largest value (yes)

Dementia

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 870531
  • The largest value (No) is over 13.9 times larger than the second largest value (Yes)

Length

Mean 2.0671
Standard Deviation 0.2502
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row No
3rd row No
4th row No
5th row No

Letter

Count 26831
Lowercase Letter 13851
Space Separator 0
Uppercase Letter 12980
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 13.9 times larger than the second largest value (yes)

Age

numerical

Approximate Distinct Count 668
Approximate Unique (%) 5.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 207680
Mean 71.6173
Minimum 18.1
Maximum 90
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Age is skewed left (γ1 = -0.9911)

Quantile Statistics

Minimum 18.1
5-th Percentile 45.3
Q1 63.4
Median 74.5
Q3 82.9
95-th Percentile 88.3
Maximum 90
Range 71.9
IQR 19.5

Descriptive Statistics

Mean 71.6173
Standard Deviation 13.8549
Variance 191.9577
Sum 929592.3
Skewness -0.9911
Kurtosis 0.7737
Coefficient of Variation 0.1935
  • Age is not normally distributed (p-value 0.002550820710649024)
  • Age has 237 outliers

Bmi

numerical

Approximate Distinct Count 3382
Approximate Unique (%) 26.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 207680
Mean 30.3375
Minimum 14
Maximum 54.47
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Bmi is skewed right (γ1 = 0.7924)

Quantile Statistics

Minimum 14
5-th Percentile 19.0395
Q1 24.16
Median 28.81
Q3 35.1225
95-th Percentile 47.4605
Maximum 54.47
Range 40.47
IQR 10.9625

Descriptive Statistics

Mean 30.3375
Standard Deviation 8.4656
Variance 71.667
Sum 393780.6648
Skewness 0.7924
Kurtosis 0.2866
Coefficient of Variation 0.279
  • Bmi is not normally distributed (p-value 0.001125268549574447)
  • Bmi has 350 outliers

Weight

numerical

Approximate Distinct Count 325
Approximate Unique (%) 2.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 207680
Mean 190.9861
Minimum 42
Maximum 392
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Weight is skewed right (γ1 = 0.7809)

Quantile Statistics

Minimum 42
5-th Percentile 112
Q1 149
Median 182
Q3 225
95-th Percentile 301
Maximum 392
Range 350
IQR 76

Descriptive Statistics

Mean 190.9861
Standard Deviation 58.1041
Variance 3376.0889
Sum 2.479e+06
Skewness 0.7809
Kurtosis 0.5289
Coefficient of Variation 0.3042
  • Weight is not normally distributed (p-value 0.002129271133751218)
  • Weight has 268 outliers

Height

numerical

Approximate Distinct Count 34
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 207680
Mean 66.3621
Minimum 40
Maximum 76
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Height is skewed left (γ1 = -0.097)

Quantile Statistics

Minimum 40
5-th Percentile 60
Q1 63
Median 66
Q3 70
95-th Percentile 73
Maximum 76
Range 36
IQR 7

Descriptive Statistics

Mean 66.3621
Standard Deviation 4.3018
Variance 18.5058
Sum 861380
Skewness -0.09699
Kurtosis -0.08913
Coefficient of Variation 0.06482
  • Height is not normally distributed (p-value 0.006547615646077652)
  • Height has 19 outliers

Pulse

numerical

Approximate Distinct Count 96
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 207680
Mean 78.6753
Minimum 48
Maximum 145
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Pulse is skewed right (γ1 = 0.6643)

Quantile Statistics

Minimum 48
5-th Percentile 58
Q1 68
Median 77
Q3 88
95-th Percentile 106
Maximum 145
Range 97
IQR 20

Descriptive Statistics

Mean 78.6753
Standard Deviation 14.869
Variance 221.0884
Sum 1.0212e+06
Skewness 0.6643
Kurtosis 0.5316
Coefficient of Variation 0.189
  • Pulse is not normally distributed (p-value 0.0043360922523468645)
  • Pulse has 151 outliers

Temperature

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 869786
  • The largest value (98) is over 2.2 times larger than the second largest value (97)

Length

Mean 2.0097
Standard Deviation 0.09805
Median 2
Minimum 2
Maximum 3

Sample

1st row 99
2nd row 98
3rd row 98
4th row 99
5th row 98

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 26086
  • The top 2 categories (98, 97) take over 50.0%
  • The largest value (98) is over 2.2 times larger than the second largest value (97)

Pat_Pain_Score

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 207680
Mean 1.9499
Minimum 0
Maximum 10
Zeros 6282
Zeros (%) 48.4%
Negatives 0
Negatives (%) 0.0%
  • Pat_Pain_Score is skewed right (γ1 = 1.2564)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 1
Q3 3
95-th Percentile 8
Maximum 10
Range 10
IQR 3

Descriptive Statistics

Mean 1.9499
Standard Deviation 2.5708
Variance 6.6089
Sum 25310
Skewness 1.2564
Kurtosis 0.5488
Coefficient of Variation 1.3184
  • Pat_Pain_Score is not normally distributed (p-value 3.8821376510683216e-22)
  • Pat_Pain_Score has 678 outliers

ER_Visits

numerical

Approximate Distinct Count 14
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 207680
Mean 1.6991
Minimum 0
Maximum 13
Zeros 5035
Zeros (%) 38.8%
Negatives 0
Negatives (%) 0.0%
  • ER_Visits is skewed right (γ1 = 1.9494)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 1
Q3 2
95-th Percentile 6
Maximum 13
Range 13
IQR 2

Descriptive Statistics

Mean 1.6991
Standard Deviation 2.2136
Variance 4.9001
Sum 22054
Skewness 1.9494
Kurtosis 4.4677
Coefficient of Variation 1.3028
  • ER_Visits is not normally distributed (p-value 8.870801893510916e-18)
  • ER_Visits has 886 outliers

IP_Visits

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 856680
  • The largest value (0) is over 10.62 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 12980
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 10.62 times larger than the second largest value (1)
  • IP_Visits has words of constant length

Chronic_Conditions

numerical

Approximate Distinct Count 27
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 207680
Mean 6.6124
Minimum 1
Maximum 27
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Chronic_Conditions is skewed right (γ1 = 1.3241)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 3
Median 6
Q3 9
95-th Percentile 17
Maximum 27
Range 26
IQR 6

Descriptive Statistics

Mean 6.6124
Standard Deviation 4.8518
Variance 23.5399
Sum 85829
Skewness 1.3241
Kurtosis 1.908
Coefficient of Variation 0.7337
  • Chronic_Conditions is not normally distributed (p-value 8.229887661233916e-05)
  • Chronic_Conditions has 426 outliers

Glucose

numerical

Approximate Distinct Count 356
Approximate Unique (%) 2.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 207680
Mean 144.6345
Minimum 58
Maximum 423
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Glucose is skewed right (γ1 = 1.5144)

Quantile Statistics

Minimum 58
5-th Percentile 82
Q1 96
Median 127
Q3 172
95-th Percentile 281
Maximum 423
Range 365
IQR 76

Descriptive Statistics

Mean 144.6345
Standard Deviation 63.8327
Variance 4074.6175
Sum 1.8774e+06
Skewness 1.5144
Kurtosis 2.3194
Coefficient of Variation 0.4413
  • Glucose is not normally distributed (p-value 8.238953864841001e-09)
  • Glucose has 589 outliers

Condition

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 987616

Length

Mean 11.0875
Standard Deviation 1.9982
Median 13
Minimum 9
Maximum 13

Sample

1st row Pneumonia
2nd row Pneumonia
3rd row Pneumonia
4th row Pneumonia
5th row Heart_Failure

Letter

Count 137142
Lowercase Letter 117388
Space Separator 0
Uppercase Letter 19754
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Heart_Failure, Pneumonia) take over 50.0%

Care_Plan_Following_Discharge

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1044254

Length

Mean 15.451
Standard Deviation 5.6747
Median 18
Minimum 7
Maximum 24

Sample

1st row Expired
2nd row Telehealth
3rd row Skilled Nursing Fa...
4th row Telehealth
5th row Telehealth

Letter

Count 183782
Lowercase Letter 158024
Space Separator 16772
Uppercase Letter 25758
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Discharged to Home, Home Health) take over 50.0%
  • The largest value (home) is over 1.83 times larger than the second largest value (discharged)

Cost_Of_Initial_Stay

numerical

Approximate Distinct Count 12485
Approximate Unique (%) 96.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 207680
Mean 7268.5442
Minimum 9.4
Maximum 117843.12
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Cost_Of_Initial_Stay is skewed right (γ1 = 4.0754)

Quantile Statistics

Minimum 9.4
5-th Percentile 2199.094
Q1 3722.6475
Median 5579.405
Q3 8616.0425
95-th Percentile 17905.5795
Maximum 117843.12
Range 117833.72
IQR 4893.395

Descriptive Statistics

Mean 7268.5442
Standard Deviation 6172.1291
Variance 3.8095e+07
Sum 9.4346e+07
Skewness 4.0754
Kurtosis 31.4452
Coefficient of Variation 0.8492
  • Cost_Of_Initial_Stay is not normally distributed (p-value 4.942766434576092e-16)
  • Cost_Of_Initial_Stay has 871 outliers

Readmit30

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 856680
  • The largest value (0) is over 4.2 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 1
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 12980
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 4.2 times larger than the second largest value (1)
  • Readmit30 has words of constant length

Interactions

Correlations

Missing Values